A Text Mining Approach for Definition Question Answering
نویسندگان
چکیده
This paper describes a method for definition question answering based on the use of surface text patterns. The method is specially suited to answer questions about person’s positions and acronym’s descriptions. It considers two main steps. First, it applies a sequence-mining algorithm to discover a set of definition-related text patterns from the Web. Then, using these patterns, it extracts a collection of concept-description pairs from a target document database, and applies the sequence-mining algorithm to determine the most adequate answer to a given question. Experimental results on the Spanish CLEF 2005 data set indicate that this method can be a practical solution for answering this kind of definition questions, reaching a precision as high as 84%.
منابع مشابه
Using Machine Learning and Text Mining in Question Answering
This paper describes a QA system centered in a full data-driven architecture. It applies machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the use of lexical information and avoids applying any complex language processing resources such as named entity classifiers, par...
متن کاملINAOE at CLEF 2006: Experiments in Spanish Question Answering
This paper describes the system developed by the Language Technologies Lab at INAOE for the Spanish Question Answering task at CLEF 2006. The presented system is centered in a full datadriven architecture that uses machine learning and text mining techniques to identify the most probable answers to factoid and definition questions respectively. Its major quality is that it mainly relies on the ...
متن کاملText Mining in Biograph
The Biograph project is a biomedical knowledge discovery project combining graph data mining with structured biomedical information and with text mining on medline abstracts. It is a cooperation between the molecular genetics, data mining, and computational linguistics research groups of the University of Antwerp. In this talk, I will outline the general architecture of the system, which is cur...
متن کاملSemantic Content Access Using Domain-Independent NLP Ontologies
We present a lightweight, user-centred approach for document navigation and analysis that is based on an ontology of text mining results. This allows us to bring the result of existing text mining pipelines directly to end users. Our approach is domain-independent and relies on existing NLP analysis tasks such as automatic multi-document summarization, clustering, question-answering, and opinio...
متن کاملMining Paraphrasal Typed Templates from a Plain Text Corpus
Finding paraphrases in text is an important task with implications for generation, summarization and question answering, among other applications. Of particular interest to those applications is the specific formulation of the task where the paraphrases are templated, which provides an easy way to lexicalize one message in multiple ways by simply plugging in the relevant entities. Previous work...
متن کامل